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Introduction 

Although  it  is  apparent  that  people  are  able  to  make  inferences  from  graphs,  it  is 
presently  unclear  how  they  do  so,  even  from  simple  graphs.  Current  theories  of  graph 
comprehension  are  largely  silent  about  the  processes  by  which  such  inferences  are 
made  (  e.g.,  Freedman  &  Shah,  2002;  Pinker,  1990).  We  propose  that  people  use 
spatial  reasoning,  in  the  form  of  spatial  transformations  (Trafton,  Trickett,  &  Mintz, 
in  press),  to  answer  inferential  questions.  Spatial  transformations  are  cognitive 
operations  that  a  person  performs  on  internal  or  external  visualizations,  such  as 
graphs.  They  occur  when  people  must  mentally  create  or  delete  something  (e.g.,  a 
line)  on  the  image  in  order  to  facilitate  problem  solving,  and  may  be  related  to 
hypothetical  drawing  (Shimojima  &  Fukaya,  2003).  This  paper  investigates  the  use  of 
spatial  transformations  when  people  need  to  make  inferences  from  graphs. 

Method 

Eight  GMU  undergraduates  participated  for  course  credit.  Participants  were  shown  30 
unlabelled  line  graphs  and  asked  for  the  value  of  the  y  axis  at  a  given  point  on  the  x 
axis.  This  point  on  the  x  axis  was  indicated  by  a  red  arrow  in  one  of  three  different 
positions,  creating  three  conditions:  read-off  (arrow  beneath  line),  near  (arrow  slightly 
beyond  line),  and  far  (far  beyond  end  of  line)  (see  Figure  1  for  examples).  Participants 
were  shown  10  of  each  graph/condition  combination,  in  random  order;  performance 
was  self-paced,  with  a  blank  screen  appearing  between  each  graph. 

To  answer  the  question  in  the  read-off  condition  participants  had  to  move  their 
eyes  in  perpendicular  fashion  from  the  red  arrow  to  intersect  the  line,  move  their  eyes 
from  that  intersection  to  the  y  axis,  and  read  off  the  appropriate  value.  No  spatial 
transformations  were  required  for  this  task.  To  answer  the  question  in  the  near  and  far 
conditions,  we  propose  that  participants  would  mentally  extend  the  line  prior  to 
locating  its  intersection  with  the  perpendicular  from  the  red  arrow  and  completing  the 
task.  The  extension — a  mental  manipulation — constitutes  a  spatial  transformation. 
Spatial  transformation  theory  predicts  that  the  longer  the  extension,  the  longer  it 
takes;  thus,  we  predict  that  participants  will  be  fastest  in  the  read-off  (no  extension) 
condition,  somewhat  slower  in  the  near  (shorter  extension)  condition,  and  slowest  in 
the  far  (longer  extension)  condition.  We  also  predict  that  accuracy  will  decrease  with 


A.  Blackwell  et  al.  (Eds.):  Diagrams  2004,  LNAI 2980,  pp.  372-375,  2004. 
©  Springer- Verlag  Berlin  Heidelberg  2004 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

20Q4  2.  REPORT  TYPE 

3.  DATES  COVERED 

00-00-2004  to  00-00-2004 

4.  TITLE  AND  SUBTITLE 

Spatial  Transformations  in  Graph  Comprehension 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Naval  Research  Laboratory, Navy  Center  for  Applied  Research  in 

Artificial  Intelligence  (NCARAI),4555  Overlook  Avenue 

SW, Washington, DC, 20375 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS (ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 

14.  ABSTRACT 

15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF:  17.  LIMITATION  OF 

ARSTRATT 

1 8 .  NUMBER  1 9a.  NAME  OF 

OF  PAGES  RESPONSIBLE  PERSON 

a.  REPORT  b.  ABSTRACT  c.  THIS  PAGE  Same  aS 

unclassified  unclassified  unclassified  Report  (SAR) 

4 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Spatial  Transformations  in  Graph  Comprehension  373 


Fig.  1.  Line  graph  read-off  condition  (left)  and  far  condition  (right).  In  the  near  condition  (not 
illustrated)  the  arrow  was  1 5  units  to  the  right  of  the  end  of  the  line,  along  the  x  axis. 


increased  use  of  spatial  transformations,  as  people  must  move  further  from  “anchor 
points”  on  the  graph  to  obtain  needed  information.  Thus,  we  predict  that  participants 
will  be  most  accurate  in  the  read-off  condition,  somewhat  less  accurate  in  the  near 
condition,  and  least  accurate  in  the  far  condition. 

Results  and  Discussion 

We  measured  accuracy  as  the  absolute  value  of  the  participant's  response  minus  the 
correct  response.  A  score  of  0  thus  means  the  answer  was  completely  accurate; 
increased  scores  represent  a  decrease  in  accuracy.  Response  times  (RT)  represent  the 
amount  of  time  between  graph  presentation  and  entry  of  the  participant's  response. 
Responses  with  an  accuracy  score  of  more  than  100  were  considered  outliers  and 
excluded  from  analysis,  as  were  responses  whose  RT  was  greater  than  3  standard 
deviations  above  the  mean.  Outliers  constituted  less  than  5%  of  the  data. 


Fig.  2a  (left).  Mean  accuracy  scores  for  the  read-off,  near,  and  far  tasks 

Fig.  2b  (right).  Mean  response  times  in  seconds  for  the  read-off,  near,  and  far  tasks 
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As  Fig.  2a  shows,  participants  were  most  accurate  on  the  read-off  task,  less 
accurate  on  the  near  task,  and  least  accurate  on  the  far  task,  repeated  measures 
ANOVA  F( 2,  14)  =  15.46,/?  <  .01,  linear  trend  F(  1,  7)  =  18.98,/?  <  .01.  These  results 
are  consistent  with  our  hypothesis  that  participants  use  spatial  transformations  to 
execute  the  inference  tasks.  The  read-off  task  required  no  spatial  transformations,  but 
as  participants  mentally  extended  the  line  to  find  a  hypothetical  point  of  intersection 
with  the  arrow,  the  point  of  intersection  became  increasingly  distant  from  the 
“anchor”  of  the  y  axis.  Participants  were  decreasingly  accurate  as  a  result. 

It  is  possible  that  participants'  engaged  in  a  speed-accuracy  tradefoff;  however,  the 
response  time  data  indicate  that  they  became  slower  as  they  became  less  accurate.  The 
response  time  data  also  point  to  the  use  of  spatial  transformations.  Participants  were 
fastest  on  the  read-off  task,  slower  in  the  near  task,  and  slowest  on  the  far  task,  F( 2, 
14)  =  4.93,/?  <  .05,  linear  trend  F(  1,  7)  =  6.7,/?  <  .05  (see  Figure  2b).  The  linear  trend 
is  consistent  with  the  idea  that  a  longer  extension  takes  more  time  to  execute  than  a 
shorter  one.  If  this  is  true,  it  should  take  a  measurable  amount  of  time  more  for  each 
extension.  In  order  to  calculate  how  long  each  extra  extension  took,  we  did  a  multiple 
linear  regression,  using  the  distance  along  the  x  axis  participants  had  to  extend  the 
line.  This  analysis  was  significant,  r  =  .41,/?  <  .01.  The  analysis  yielded  the  following 
formula:  Response  Time  =  8.21  +  1.28(1.2),  where  8.21  seconds  is  the  baseline  time 
to  read  information  from  the  graph  and  1.28  is  the  amount  of  extra  time  required  to 
extend  the  line  each  1 .2  cm  distance  required.  This  result  supports  our  hypothesis  that 
participants  used  spatial  transformations,  by  indicating  a  systematic  relationship 
between  response  time  and  the  distance  mentally  traveled.  As  participants  had  to  draw 
longer  mental  extensions  to  the  graph,  their  response  times  systematically  increased. 

Finally,  participants'  self-reports  indicate  that  they  used  a  spatial  strategy.  After 
participants  completed  the  tasks,  we  interviewed  them  about  their  strategies  for 
performing  each  task.  Participants  unanimously  said  something  like  “I  went  straight 
up  and  over”  in  reporting  how  they  performed  the  read-off  task.  For  the  near  and  far 
tasks,  they  all  reported  some  variation  on  extending  the  line.  Typical  responses 
included,  “I  imagined  where  the  line  would  go,”  “I  estimated  how  far  you  think — ,  the 
angle  the  line  is  going  to  go,”  “You  have  to  project  the  line  with  your  eye  and  then  go 
from  the  arrow  to  the  middle  and  over  to  the  y  axis.”  These  comments  indicate  that 
participants  relied  on  the  spatial  characteristics  of  the  graph  to  make  inferences. 

In  summary,  we  have  found  that  when  people  make  inferences  about  simple 
graphs,  their  accuracy,  response  time,  and  self-report  data  suggest  that  they  use  spatial 
reasoning,  in  the  form  of  spatial  transformations.  Given  that  current  theories  of  graph 
comprehension  provide  no  account  of  mechanisms  by  which  people  make  such 
inferences,  we  propose  that  a  comprehensive  theory  of  graph  comprehension  should 
accommodate  spatial  reasoning,  as  indicated  by  these  data. 
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